In morphology and lexicography, a lemma (pl.: lemmas or lemmata) is the canonical form,[1] dictionary form, or citation form of a set of word forms.[2] In English, for example, break, breaks, broke, broken and breaking are forms of the same lexeme, with break as the lemma by which they are indexed. Lexeme, in this context, refers to the set of all the inflected or alternating forms in the paradigm of a single word, and lemma refers to the particular form that is chosen by convention to represent the lexeme. Lemmas have special significance in highly inflected languages such as Arabic, Turkish, and Russian. The process of determining the lemma for a given lexeme is called lemmatisation. The lemma can be viewed as the chief of the principal parts, although lemmatisation is at least partly arbitrary.
A minor... problem can arise when the canonical form of the headword, i.e. the form in which it is to be cited, is to be chosen.